Generalized reaction patterns for prediction of unknown enzymatic reactions.

نویسندگان

  • Yugo Shimizu
  • Masahiro Hattori
  • Susumu Goto
  • Minoru Kanehisa
چکیده

Prediction of unknown enzymatic reactions is useful for understanding biological processes such as reactions to external substances like endocrine disrupters. To create an accurate prediction, we need to define a similarity measure in the reaction. We have developed the KEGG RPAIR database which is a collection of chemical structure transformation patterns, called RDM patterns, for substrate-product pairs of enzymatic reactions. In this study, we compared RDM patterns with EC numbers which are the well-known hierarchical classification scheme for enzymes. Additionally, we performed hierarchical clustering of RDM patterns using the information stating whether each sub-subclass of EC has a particular RDM patterns or not. To represent the variation of RDM patterns in a cluster, we generalized RDM patterns in the same cluster using the hierarchy of KEGG Atomtypes, which are the components of RDM patterns. Using this generalized pattern, we can predict which cluster includes a given RDM pattern even if the reaction of the pattern has not been assigned any EC numbers. Thus we will be able to define the similarity between enzymatic reactions by using this cluster information.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Supervised de novo reconstruction of metabolic pathways from metabolome-scale compound sets

MOTIVATION The metabolic pathway is an important biochemical reaction network involving enzymatic reactions among chemical compounds. However, it is assumed that a large number of metabolic pathways remain unknown, and many reactions are still missing even in known pathways. Therefore, the most important challenge in metabolomics is the automated de novo reconstruction of metabolic pathways, wh...

متن کامل

Chemical Continuity of Reaction Centers Along Successive Metabolic Reactions

Cellular functions often come from intricate networks of molecular interactions, which involve not only proteins and nucleic acids but also small chemical compounds. To understand the complexity of life system it is required to know the characteristics of network diagram of those molecules. In this decade, lots of efforts have been made to investigate various types of biological networks includ...

متن کامل

Kerman Health System Workers Knowledge and Attitudes Regarding the Spontaneous Reporting System for Adverse Drug Reactions

         Adverse drug reaction (ADR) is one of the most life threatening problems, and the economic burden of ADR is considerable. The main objective of this study was to assess the attitude of the Kerman health system staff, to evaluate their knowledge of the spontaneous reporting system and to identify the reasons for low reporting rate. In this descriptive study, a Persian translated questio...

متن کامل

Automatic single- and multi-label enzymatic function prediction by machine learning

The number of protein structures in the PDB database has been increasing more than 15-fold since 1999. The creation of computational models predicting enzymatic function is of major importance since such models provide the means to better understand the behavior of newly discovered enzymes when catalyzing chemical reactions. Until now, single-label classification has been widely performed for p...

متن کامل

Metabolome-scale prediction of intermediate compounds in multistep metabolic pathways with a recursive supervised approach

MOTIVATION Metabolic pathway analysis is crucial not only in metabolic engineering but also in rational drug design. However, the biosynthetic/biodegradation pathways are known only for a small portion of metabolites, and a vast amount of pathways remain uncharacterized. Therefore, an important challenge in metabolomics is the de novo reconstruction of potential reaction networks on a metabolom...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • Genome informatics. International Conference on Genome Informatics

دوره 20  شماره 

صفحات  -

تاریخ انتشار 2008